AITopics | end-to-end optimization

Collaborating Authors

end-to-end optimization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Information-driven design of imaging systems

AIHubMar-23-2026, 09:22:05 GMT

Our information estimator uses only these noisy measurements and a noise model to quantify how well measurements distinguish objects. Many imaging systems produce measurements that humans never see or cannot interpret directly. Your smartphone processes raw sensor data through algorithms before producing the final photo. MRI scanners collect frequency-space measurements that require reconstruction before doctors can view them. Self-driving cars process camera and LiDAR data directly with neural networks.

information, machine learning, natural language, (19 more...)

AIHub

Country:

North America > United States > Oregon (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.05)
Asia > Singapore (0.05)

Industry: Health & Medicine (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.71)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.55)
Information Technology > Communications > Social Media (0.49)
(2 more...)

Add feedback

When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking

Neural Information Processing SystemsDec-23-2025, 21:58:44 GMT

Multipartite ranking is a basic task in machine learning, where the Area Under the receiver operating characteristics Curve (AUC) is generally applied as the evaluation metric. Despite that AUC reflects the overall performance of the model, it is inconsistent with the expected performance in some application scenarios, where only a low False Positive Rate (FPR) is meaningful. To leverage high performance under low FPRs, we consider an alternative metric for multipartite ranking evaluating the True Positive Rate (TPR) at a given FPR, denoted as TPR@FPR. Unfortunately, the key challenge of direct TPR@FPR optimization is two-fold: \textbf{a)} the original objective function is not differentiable, making gradient backpropagation impossible; \textbf{b)} the loss function could not be written as a sum of independent instance-wise terms, making mini-batch based optimization infeasible. To address these issues, we propose a novel framework on top of the deep learning framework named \textit{Cross-Batch Approximation for Multipartite Ranking (CBA-MR)}. In face of \textbf{a)}, we propose a differentiable surrogate optimization problem where the instances having a short-time effect on FPR are rendered with different weights based on the random walk hypothesis. To tackle \textbf{b)}, we propose a fast ranking estimation method, where the full-batch loss evaluation is replaced by a delayed update scheme with the help of an embedding cache. Finally, experimental results on four real-world benchmarks are provided to demonstrate the effectiveness of the proposed method.

end-to-end optimization, false positive, multipartite ranking, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking

Neural Information Processing SystemsOct-9-2024, 20:34:15 GMT

end-to-end optimization, fpr, multipartite ranking, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

Zamani, Hamed, Bendersky, Michael

arXiv.org Artificial IntelligenceMay-5-2024

This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of marginalization and document independence, made in most prior work. Stochastic RAG casts the retrieval process in RAG as a stochastic sampling without replacement process. Through this formulation, we employ straight-through Gumbel-top-k that provides a differentiable approximation for sampling without replacement and enables effective end-to-end optimization for RAG. We conduct extensive experiments on seven diverse datasets on a wide range of tasks, from open-domain question answering to fact verification to slot-filling for relation extraction and to dialogue systems. By applying this optimization method to a recent and effective RAG model, we advance state-of-the-art results on six out of seven datasets.

computational linguistic, dataset, proceedings, (13 more...)

arXiv.org Artificial Intelligence

2405.02816

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > District of Columbia > Washington (0.05)
North America > United States > New York > New York County > New York City (0.05)
(13 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
(2 more...)

Add feedback

The shift from models to compound AI systems

AIHubMar-15-2024, 09:00:00 GMT

AI caught everyone's attention in 2023 with Large Language Models (LLMs) that can be instructed to perform general tasks, such as translation or coding, just by prompting. This naturally led to an intense focus on models as the primary ingredient in AI application development, with everyone wondering what capabilities new LLMs will bring. As more developers begin to build using LLMs, however, we believe that this focus is rapidly changing: state-of-the-art AI results are increasingly obtained by compound systems with multiple components, not just monolithic models. For example, Google's AlphaCode 2 set state-of-the-art results in programming through a carefully engineered system that uses LLMs to generate up to 1 million possible solutions for a task and then filter down the set. AlphaGeometry, likewise, combines an LLM with a traditional symbolic solver to tackle olympiad problems.

application, compound ai system, compound system, (16 more...)

AIHub

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Deep learning for ECoG brain-computer interface: end-to-end vs. hand-crafted features

Śliwowski, Maciej, Martin, Matthieu, Souloumiac, Antoine, Blanchart, Pierre, Aksenova, Tetiana

arXiv.org Artificial IntelligenceOct-12-2022

In brain signal processing, deep learning (DL) models have become commonly used. However, the performance gain from using end-to-end DL models compared to conventional ML approaches is usually significant but moderate, typically at the cost of increased computational load and deteriorated explainability. The core idea behind deep learning approaches is scaling the performance with bigger datasets. However, brain signals are temporal data with a low signal-to-noise ratio, uncertain labels, and nonstationary data in time. Those factors may influence the training process and slow down the models' performance improvement. These factors' influence may differ for end-to-end DL model and one using hand-crafted features. As not studied before, this paper compares models that use raw ECoG signal and time-frequency features for BCI motor imagery decoding. We investigate whether the current dataset size is a stronger limitation for any models. Finally, obtained filters were compared to identify differences between hand-crafted features and optimized with backpropagation. To compare the effectiveness of both strategies, we used a multilayer perceptron and a mix of convolutional and LSTM layers that were already proved effective in this task. The analysis was performed on the long-term clinical trial database (almost 600 minutes of recordings) of a tetraplegic patient executing motor imagery tasks for 3D hand translation. For a given dataset, the results showed that end-to-end training might not be significantly better than the hand-crafted features-based model. The performance gap is reduced with bigger datasets, but considering the increased computational load, end-to-end training may not be profitable for this application.

artificial intelligence, freq max, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2210.02544

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.05)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.88)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Amazon Researchers Designed A New Machine Learning Algorithm Based On Entropy Balancing That Learns Weights To Directly Maximize Causal Inference Accuracy Using End-To-End Optimization

#artificialintelligenceAug-3-2022, 12:52:12 GMT

A causal effect means that a certain thing is happening based on something that has already occurred. In business, the causal effect of a treatment is very important, for example, changing the font of a page based on the amount of time spent by a user. Treatments can either be binary or can be continuous. Confounding factors: it's the third variable while examining a cause and effect relationship. Usually, there exist confounding factors that influence the treatment as well as response relationship and causal estimation accounts for them.

amazon researcher designed, maximize causal inference accuracy, new machine learning algorithm, (5 more...)

#artificialintelligence

Country: Asia > India > West Bengal > Kharagpur (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Artificial Intelligence Improves Control of Prosthetic Hands

#artificialintelligenceOct-2-2019, 13:12:30 GMT

Scientists from the University of Texas at Dallas announced a groundbreaking new approach for improving control of prosthetics with the use of artificial intelligence (AI) at the 2019 IEEE International Symposium on Measurement and Control in Robotics Symposium this month. The research findings show a huge leap forward in the goal of fully end-to-end optimization of electromyography (EMG) controlled prosthetic hands. There are more than 40 million amputees across the globe, according to the World Health Organization. Recent advances in prosthetic hand and limb technology have greatly improved the quality of life for upper-limb amputees. However, gaps remain in the control of prosthetic hands, specifically in using naturally generated electric signals from the patient's muscles.

mohsen jafarzadeh, prosthetic hand, university, (8 more...)

#artificialintelligence

Country: North America > United States > Texas (0.32)

Genre:

Press Release (1.00)
Research Report > New Finding (0.53)

Industry:

Health & Medicine > Therapeutic Area > Orthopedics/Orthopedic Surgery (1.00)
Health & Medicine > Health Care Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.41)

Add feedback